Picture for Min Zhang

Min Zhang

Jake

CASTLE: A Comprehensive Benchmark for Evaluating Student-Tailored Personalized Safety in Large Language Models

Add code
Feb 05, 2026
Viaarxiv icon

PACE: Defying the Scaling Hypothesis of Exploration in Iterative Alignment for Mathematical Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models

Add code
Feb 05, 2026
Viaarxiv icon

LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding

Add code
Feb 04, 2026
Viaarxiv icon

Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition

Add code
Feb 04, 2026
Viaarxiv icon

Decoupling Skeleton and Flesh: Efficient Multimodal Table Reasoning with Disentangled Alignment and Structure-aware Guidance

Add code
Feb 03, 2026
Viaarxiv icon

Instruction Anchors: Dissecting the Causal Dynamics of Modality Arbitration

Add code
Feb 03, 2026
Viaarxiv icon

CVeDRL: An Efficient Code Verifier via Difficulty-aware Reinforcement Learning

Add code
Jan 30, 2026
Viaarxiv icon

Reversible Diffusion Decoding for Diffusion Language Models

Add code
Jan 29, 2026
Viaarxiv icon

VC-Bench: Pioneering the Video Connecting Benchmark with a Dataset and Evaluation Metrics

Add code
Jan 27, 2026
Viaarxiv icon